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Abstract: 

We present a strategy to quantify the performance of jet definitions in kinematic reconstruc- 
tion tasks. It is designed to make use exclusively of physical observables, in contrast to previous 
techniques which often used unphysical Monte Carlo partons as a reference. It is furthermore in- 
dependent of the detailed shape of the kinematic distributions. We analyse the performance of 5 
jet algorithms over a broad range of jet-radii, for sources of quark jets and gluon jets, spanning 
the energy scales of interest at the LHC, both with and without pileup. The results allow one to 
identify optimal jet definitions for the various scenarios. They confirm that the use of a small jet 
radius {R ~ 0.5) for quark-induced jets at moderate energy scales, O (100 GeV), is a good choice. 
However, for gluon jets and in general for TeV scales, there are significant benefits to be had from 
using larger radii, up to > 1. This has implications for the span of jet-definitions that the LHC 
experiments should provide as defaults for searches and other physics analyses. 
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1 Introduction 

A recurring question in jet studies is "what is the best jet definition for a given specific analysis"? 
One approach to answering such a question is to repeat the analysis for a large range of jet definitions 
and selecting the best, whatever this means in the context of the given analysis. This can be rather 
time consuming. Furthermore, experiments may not have easy access to a sufficiently large array of 
jet definitions — for example only a handful may be calibrated and included as standard in event 
records. It is therefore important to have advanced knowledge about the types and the span of jet 
definitions that are likely to be optimal, independently of details of specific analyses. 

In this paper we investigate the question of identifying optimal jet definitions with the help 
of characterisations of jet-finding "quality" that are designed to be robust and physical, as well 
as reasonably representative of a jet definition's quality for kinematic reconstruction tasks. We 
concentrate on kinematic reconstructions (rather than more QCD-oriented measurements, such as 
the inclusive-jet spectrum), because they are a key element in a wide range of LHC investigations, 
including top-quark studies and new-particle searches. We already presented in [T] a similar, though 
less systematic and extensive, investigation. 

The quality of a given jet definition may depend significantly on the process under consideration, 
i.e. how the jets are produced. Here we will examine both quark and gluon-induced jets, spanning 
a range of energies. They will be obtained from Monte Carlo production and decay of fictitious 
narrow Z' and H bosons, with Z' qq and H gg. For each generated event we will cluster 
the event into jets with about 50 different jet definitions and determine the invariant mass of the 
sum of the two hardest jets. The distribution of invariant masses should then have a peak near the 
heavy boson mass. We will take the sharpness of that peak to be indicative of the quality of each 
jet definition. By scanning a range of Z', H masses we will establish this information for a range 
of partonic energies. Z' qq and H ^ gg events are comparatively simple, perhaps overly so, 
therefore we will complement them with studies of fully hadronic decays of tt events. 

Our approach differs crucially from usual investigations of jet-definition quality in that it avoids 
any matching to unphysical Monte Carlo partons, whose relation to the jets can depend as much on 
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the details of the Monte Carlo showering algorithm (for example its treatment of recoil) as on the 
jet definition. A reflection of this is that in modern tools such as MC@NLO ^ and POWHEG [3], 
which include exact NLO corrections, the original Monte Carlo parton does not even exist. 

A further issue that we address relates to the measurement of the sharpness of the peak. Past 
approaches have involved, for example, fitting a Gaussian to the peak (see e.g. [H [5]) and then 
using its standard deviation as the quality measure. This (and related methods) are however quite 
unsuited to the range of peak shapes that arise, and we will therefore devise strategies for measuring 
peak-quality independently of the precise peak shape. 

The results presented in the sections below, without pileup in section [3] and with pileup in 
section [H are complemented by an interactive web-site [6j , which collects a far broader range of 
plots than can be shown here. 

2 Analysis chain 

2.1 Event generation 

We consider the following processes in pp collisions at 14 TeV centre of mass energy: qq ^ Z' ^ qq, 
as a source of quark jets with well-defined energies, for values of Mz' from 100 GeV to 4 TeV; 
gg ^ H ^ gg as a source of gluons jets (as done also by Biige et al. in p]), in a similar mass 
range; and fully hadronic tt events with Mt = 175 GeV and Mw = 80.4 GeV, as an example of a 
more complex environment. 

For the Z' and H samples, the heavy-boson width has been set to (a fictitious value) of less 
than 1 GeV so as to produce a (5-like peak for ideal mass reconstruction, or equivalently so as to 
provide a mono-energetic source of jets. One should be aware also that the span of masses used 
here does not correspond to a physically sensible range for real Higgs or Z' bosons. This is not an 
issue, insofar as we are only interested in the Higgs and Z' as well-defined sources of quarks and 
gluons in Monte Carlo studies. To emphasise this fact, in what follows we shall refer simply to the 
"(?^" and "55" processes. 

All the samples have been generated with Pythia 6.410 [7] with the DWT tune [8]. For the tt 
samples the B mesons have been kept stable. 

2.2 Jet definitions 

A jet definition [I] is the combination of a jet algorithm, its parameters {e.g. the radius R) and 
choice of recombination scheme. It fully specifies a mapping from particles to jets. 
We will study the following infrared and collinear-safe jet algorithms: 

1. the longitudinally invariant inclusive kt algorithm [9l 1101 ^Tl]- ^ sequential-recombination al- 
gorithm whose distance measure is the relative transverse momentum between particles. 

2. The Cambridge/ Aachen (C/A) algorithm [121 113] . also a sequential-recombination algorithm, 
which uses the rapidity-azimuth separation between particles as its distance measure. 

3. The anti-fct algorithm [Ijj, yet another sequential-recombination algorithm, with the property 
that it produces conical jets (akin to an iterative cone algorithm with progressive removal, 
such as the current CMS cone algorithm, but without the corresponding collinear unsafety 
issues). 
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4. SISCone [15], a seedless-cone type algorithm with a split-merge step, whose overlap threshold 
IS set to / = 0.750 Additionally, we use the default choices of an infinite number of passes 
and no pj-cut on stable cones. 

5. C/A with filtering (see below). 

In each case, we will add four-momenta using E-scheme (4-vector) recombination. The algorithms 
all have a parameter R, the jet radius, which controls the opening angle of the jets in the rapidity- 
azimuth plane. Results will be quite sensitive to the choice of R and we will vary it over a suitable 
range for each process. 

In the case of the C/A algorithm, whose clustering sequence is ordered in rapidity- azimuth 
distance (~ emission angle), we will also consider the impact of a filtering procedure pJTj in which, 
subsequent to the jet finding, each jet is unclustered down to sub jets at angular scale Xf^R and one 
retains only the ngit hardest of the subjets. We use xgit = 0.5 and ngit = 2. Filtering is designed to 
limit sensitivity to the underlying event while retaining the bulk of perturbative radiation. It is a 
new technique and our scope is not to investigate it in depth (for instance by also varying xgit and 
nfiit), but rather to examine its potential beyond its original context. 

All the jet algorithms have been used in the implementations and/or plugins of the Fast Jet 
package [18^ I19j. version 2.3, with the exception of C/A with filtering, which will be made public 
in a forthcoming Fast Jet release. 

2.3 Event selection and analysis 

For each event in the qq and gg processes, the reconstruction procedure is the following: 

1. Carry out the jet finding using all final-state particles, taking the definition of hadron level 
proposed as standard in [J. 

2. Keep only events in which the two hardest jets have pT > lOGeV, |y| < 5 and rapidity 
difference |Ay| < 1 (the last of these conditions ensures that the corresponding hard partons 
cover a limited range of transverse momenta, close to M/2). 

3. Reconstruct the invariant mass of the two hardest jets. 
For the fully hadronic tt process: 

1. Carry out the jet finding as above. 

2. Keep only events in which the 6 hardest jets have pT > lOGeV and \y\ < 5, and of which 
exactly two are 6-tagged {i.e. contain one or more i3-hadrons). 

3. Using the four non 6-tagged jets, consider the 3 possible groupings into two pairs (i.e. two 
candidate M^-bosons). For each grouping, calculate the invariant mass of each pair of jets and 
keep the grouping that minimises [Mi^i^ — M\yY + i^hU ~ ^wY- 

4. Reconstruct the invariant masses for the two top quarks by pairing the b and W jets. The 
ambiguity in the hW pairing is resolved by taking the solution that minimises the mass 
difference between the two candidate top quarks. 

^The value of / has been chosen to avoid the "monster-jets" [16] that can appear with a previously common default 
of / = 0.5. 
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Figure 1: The dijet mass distribution for qq events witlr Mqq — 100 GeV, compared to a Gaussian fitted for 
reconstructed dijet masses between 75 and 125 GeV, for two jet definitions: Cambridge/ Aachen with R = 0.3 
(left) and R = 0.7 (right). 



Note that we do not pass the events through any form of detector simulation. However the choices 
of quality- measures that we use to study the invariant-mass distributions will, in part, take into 
account known detector resolutions. 



2.4 Figures of merit 

The above procedure will give us invariant mass distributions for each qq (i.e. Z'), and gg {H) 
mass (and for W and top in the tt sample) and for each jet definition. We next need to establish 
a systematic procedure for measuring the peak quality in each distribution. One option would be 
to use a Gaussian fit to the peak. Figure [1] illustrates the main difficulty with this option, i.e. that 
invariant mass peaks are anything but Gaussian. One might also consider using the variance of 
the invariant mass distribution. This, however, fares poorly in reflecting the quality of the peak 
because of a large sensitivity to the long tails of any distribution. We therefore need to devise 
peak-quality measures that are independent of any specific functional parametrisation, and truly 
reflect the nature of the peak itself. 

Our logic will be the following. Given two peaks, if they both have similar widths then it is the 
taller one that is better; if they have similar numbers of events then it is the narrower one that is 
better. These considerations lead to us to define the two possible measures: 

1. Qy=z- the width of the smallest (reconstructed) mass window that contains a fraction f = z 
of the generated massive objectsjl that is 

^ _ reco. massive objects in window of width w 
\ Total # generated massive objects 

A jet definition that is more effective in reconstructing the majority of massive objects within 
a narrow mass peak gives a lower value for Therefore smaller values QJ^^ indicate 

"better" jet definitions. 

^Tlie number of generated massive objects can differ from tiie totai number of events. For exampie if in tlie tt 
sampies we liave A'ev = 10^, tlie number of generated W bosons (and top quarks) is A^w = 2 • 10^. 
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Process 


7^ Gen. events 


# Acc. events 


Fraction acc. vs. gen. 


z in eq. ([T]) 


Z'^qq 


50 000 


~ 23000 


~ 0.46 


0.12 


H^gg _ 


50 000 


~ 27000 


~ 0.54 


0.13 


Hadronic tt 


100 000 


~ 75 000 


~ 0.75 


0.18 



Table 1: Number of generated events, and those accepted after event selection cuts, together with the 
fraction of generated events that this corresponds to, and finally the value of z used in eq. ([T]) (chosen to be 
roughly 1/4 of the previous column) . 

Note that we normalise to the total number of generated objects rather than the (smaller) 
number of objects corresponding to the events that pass the selection cuts. This ensures that 
we do not favour a jet definition for which only an anomalously small fraction of events pass 
the selection cuts (as can happen in the ti events for large i?, where jets are often spuriously 
merged), even if the JD gives good kinematic reconstruction on that small fraction. 

The value of z will be chosen, separately for each process, so that with a typical JD the 
window contains about 25% of the massive objects in the events that pass the cuts. The 
values used for z are listed in table [H 

2. to compute this quality measure, we take a window of fixed width w and slide it 

over the mass distribution so as to as maximise its contents. Then the figure of merit is given 
by 

^i/f _ ( Max ^ reco. massive objects in window of width w = x\fM 
w=x\fM \ Total # generated massive objects 

where the inverse has been taken so that a better jet definition leads to a smaller Q^' ^ , — , 

as above. We set the width equal to x\/M, where M is the nominal heavy object mass and 
X a constant to be chosen. This reflects the characteristic energy-dependence of resolution in 
hadronic calorimeters. We take x = 1.25\/ GeV, a value that is in the ballpark of currently 
quoted resolutions for the CMS and ATLAS experiments. The reader should be aware that 
this choice is associated with a degree of arbitrariness. 

In tests of a range of possible quality measures for mass reconstructions (including Gaussian 
fits, and the width at half peak height), the above two choices have been found to be the least 
sensitive to the precise shape of the reconstructed mass distribution, and have the advantage of 
being independent of the binning of the distribution. Another encouraging feature, which will be 
seen below, is that the two measures both lead to similar conclusions on the optimal algorithms 
and R values. 

2.5 Quantitative interpretation of figures of merit 

It is useful to establish a relation between variations of the above quality measures and the corre- 
sponding variation of integrated luminosity needed to maintain constant significance for a signal 
relative to background. This will allow us to quantify the importance of any differences between 
jet definitions (JD) and the potential gain to be had in using the optimal one. The relation will 
be relevant in the case in which the intrinsic width of the physical resonance that one is trying to 
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reconstruct is no larger than our (narrow-resonance based) reconstructed dijet-peak — for a very 
broad physical resonance, the jet-reconstruction quality instead becomes irrelevant. 

Our relation will be valid for two background scenarios: one in which the background is flat 
and independent of the jet definition; and another in which the background is not necessarily flat, 
but the signal peak and the background shift together as one changes the jet definition (and the 
second derivative of the background distribution is not too large). For both scenarios, in a window 
centred on the signal peak, the number of background events will be proportional to the window 
width, and the constant of proportionality will be independent of the jet definition. 

The significance of a signal with respect to the background. 



S(JD) 



signal 



(3) 



where Nsignai and A^bkgd are respectively the number of signal and background events, can then be 
rewritten as 

S (JD) = , ^'g*^^' , (4) 



where is the width of the window in which we count signal and background events. The 
argument JD serves as a reminder that the significance will in general depend on the jet definition, 
and C is a constant independent of the jet definition thanks to our assumptions above on the 
structure of the background. 

We can now establish the following relations between ratios of quality measures for two jet 
definitions and corresponding ratios of significance. The latter will then relate directly to ratios of 
luminosities needed to achieve the same significance. 

• in the case of QJ^^ number of signal events is kept fixed and the window width depends 
on the jet definition. We have then 



S(JDi) 
S(JD2) 



^^bkgd 


1/2 




1/2 


"Q7=,(JD2)" 


1/2 


/^bkgd. 








Q7=.(JDi). 





(5) 



'w=xVM 

of signal events in the window that depends on the jet definition. Hence 



• In the case of Q^',^^ mi^) the window width is kept constant and it is instead the number 



JDi 



£(JDl) _ iVslgnal 



w=xy M 



(6) 



Both of these expressions are consistent with the statement that a larger value of a quality 

measure indicates a worse jet definition (i. e. the significance is smaller at fixed integrated luminosity 
C). This in turn implies that a larger integrated luminosity will be needed to obtain a given fixed 
significance. It is convenient to express this in terms of an effective luminosity ratio. 



P£(JD2/JDi) = 



£(needed with JD2) _ [^(JDi) 
C{needed with JDi) ~ [e(JD2) 



(7) 



7 



Given a certain signal significance with JDi, /9£(JD2/JDi) indicates the factor more luminosity 
needed to obtain the same significance with JD2ll The expressions for pc in terms of the two 
quality measures are 



and 



P£(JD2/JDi 



P£(JD2/JDi) 



Q 



1// 

w=x\/M 



(JDi) 
(JD2) 



(8) 



(9) 



A non-trivial check will be that the luminosity ratios obtained with these two different expressions 
are consistent with each other. We shall see below that this is generally the case. 



3 Results without pileup 

Let us start by illustrating the quality measures of section 12.41 for two examples of the processes 
discussed in section 12.11 Figure [2] shows dijet invariant mass distributions for the 100 GeV qq 
case (upper 6 plots) and the 2TeV gg case (lower 6 plots). In each case we show 3 different jet 
definitions. Together with the histograms, we have included a shaded band that represents the 
region used to calculate the quality measures. In the first and third row we consider QJ^z ^^'^ 

the quality measure is given by the width of the (cyan) band. In the second and fourth rows, the 

1/ f 

histograms are the same, but we now show the (dark-green) band used in determining Q r— 
— the quality measure is given by the total number of generated events, divided by the number of 
events contained in the band. 

Within a given row of figure [2] (same process, but different jet definitions), the histograms that 
"look" best (i.e. the rightmost plots) are also those with the smallest quality measures, as should 
be the case. Furthermore in the situations where one histogram looks only moderately better than 
another {e.g. top row, central and right plots), the values of the quality measures are appropriately 
close. This gives us a degree of confidence that the quality measures devised in the previous section 
behave sensibly, and provide a meaningful numerical handle on the otherwise fuzzy concept of 
"best-looking" . 

Before moving on to a more systematic studies of how the quality measures depend on the choice 
of jet definition, we observe that in figure [21 smaller R gives better results for the 100 GeV qq case, 
while for the the 2TeV gg case, the opposite happens. This is a concrete illustration of the fact 
that there is no universal best jet definition. 

1/ f 

Next, in figured we show the values of the two quality measures (left: (5^=2' i^ig^t Q ^) 

J ID — L . Zioy/ Ivl 

for different jet algorithms as a function of R. The top and middle rows correspond to the two 
processes already studied in figure [2] (100 GeV qq and 2TeV gg), while the bottom row corresponds 
to top reconstruction in ti events. These plots allow one to compare the different jet algorithms, 
and for each one to determine the radius value that gives the best quality measure. The curves 
confirm the earlier observation that the 2 TeV gg case prefers a substantially larger choice of R than 
the 100 GeV qq case. To understand this characteristic, it is useful to consider figure [H which gives 
the best R (i.e. position of the minimum of the quality measure for each algorithm) as a function 
of momentum scale, separately for the quark and gluon cases. There one sees that for gluonic jets 

•^Alternatively, for a fixed integrated luminosity, a/ p£(JD2/JDi) indicates the extra factor of signal significance 
that would be gained with JDi compared to JD2. 



8 




80 100 120 
dijet mass [GeV] 



0.05 

0.04 

5 0.03 
z 

"D 

^ 0.02 
0.01 




k,, R=1.0 

'-'w=1.25\''M = 




80 100 120 
dijet mass [GeV] 



80 100 120 
dijet mass [GeV] 



80 100 120 
dijet mass [GeV] 



SISCone, R=0.5, f=0.75 

Qw=1.25VM = 




O 
O 

O 
CO 

< 



80 100 120 
dijet mass [GeV] 



80 100 120 
dijet mass [GeV] 



0.04 
g 0.03 

XI 

:o 

? 0.02 
z 

0.01 


0.04 

g 0.03 
XI 

? 0.02 
z 

0.01 




R=0.5 




1900 2000 2100 
dijet mass [GeV] 



" R=0.5 




'-Jw=1.25\'M - 


15.9 







1900 2000 2100 
dijet mass [GeV] 




1900 2000 2100 
dijet mass [GeV] 
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Figure 2: Illustrative dijet invariant mass distributions for two processes (above: qq case at M = 
100 GeV; below: gg case at M = 2TeV), comparing three jet definitions for each process. The 
shaded bands indicate the regions used when obtaining the two different quality measures. Note 
that different values of R have been used for the qq and gg cases. 
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Fi gure 3: The quality measures Qy^.^ (left) and Q _^ 25VM for different jet algorithms as a function 

of R, for the 100 GeV qq case (top row), 2TeV gg (middle row) and top reconstruction in events (bottom 
row). 
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Figure 4: The optimal value for i? as a function of the mass of the qq/gg system (upper/lower rows), as 
determined from the two quality measures (left, right columns) for various jet algorithms. 



one prefers a larger R than for quark jets, and one also prefers a larger R as one moves to higher 
momentum scales. 

This general pattern was predicted in [20] , and is understood in terms of an interplay between the 
jet needing to capture perturbative radiation, but without excessive contamination from underlying- 
event (UE) "noise": whereas perturbative arguments alone would favour R of order 1, the need to 
limit the amount of UE in the jet pushes one to lower R. The UE matters most relative to the jet 
energy for \ow-pt jets, and perturbative radiation matters more for gluon jets. 

A further remark is that the optimal values of R found for processes involving ~ 100 GeV mass 
scales, R ~ 0.5, correspond quite closely to values used typically by the Tevatron experiments and 
m many LHC studies (see e.g. [2ll[22])EI Our analysis here confirms that those are therefore good 
choices. However, at the high scales that will be probed by LHC, ^ 1 TeV, our results indicate that 
it is important for the experiments to use jet definitions with somewhat larger values of R. 

The quantitative impact of a poor choice of jet definition is illustrated in figure [5l For each 
process, we have identified the jet definition, JDbest, that provides the best (lowest) value of the 
quality measure (c/. table[2]). Then for every other jet definition, JD, we have calculated the effective 
increase in luminosity, p£(JD/JDbcst) as in eq. ([7]), that is needed to obtain as good a significance 
as with JDbest- This is shown for each jet algorithm as a function of R, with (red) solid lines for 

■'For the Tevatron there will actually be a preference for slightly larger R values than at LHC, a consequence of 
the more modest UE. 
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Process 


JDbcst 


Q (GeV) 


JDbcst 


Q 


qq, 100 GeV 


SISCone 


R=0.5 


7.38 


SISCone i?=0.5 


5.83 


qq, 2 TeV 


C/A-filt 


R=0.9 


20.8 


SISCone i?=1.0 


5.18 


gg, 100 GeV 


SISCone 


R=0.6 


14.7 


SISCone R=0.6 


8.78 


gg, 2 TeV 


C/A-filt 


i?=1.3 


55.2 


C/A-mt i?=1.3 


7.64 


W in tt 


anti-kt 


R=OA 


10.7 


anti-kt R=OA 


5.37 


t in tt 


anti-kt 


R=OA 


19.9 


anti-kt R=OA 


6.44 



Table 2: The JDbcst jet definitions for the various processes of figure O together with the corre- 
sponding (5(JDbcst) values used in calculating pc- In the 2 TeV qq case, the JDbcst definitions differ, 
but figure [5] shows that they lead to very similar quality measures, and the question of which is 
"best" ultimately depends on fine details of their behaviour. 

1/ f 

Q^=z^ using eq. ([8]), and (blue) dashed lines for Q_^_^ 25^*7' eq. ([9]). 

A first observation is that in general, the two quality measures lead to similar results for pc- 
This is a non-trivial check that the procedure is consistent and that our quality measures behave 
sensiblyll One should be aware that there is some degree of arbitrariness in the choice of z for Q^=z 

and X for Q Accordingly, we have also examined results for the case where these choices 

1/ f 

are doubled, and verified that pc is again similar (in this respect Q _ seems to be somewhat 
more stable, cf. the web-pages at [B]). 

Next, let us discuss the impact of using the worst jet algorithm (at its best R) compared to the 
best jet definition. At small energy scales one requires about 10 — 20% extra luminosity, a modest 
effect. At high masses this increases to 30 — 40%. In general it seems that SISCone and C/A-filt 
are the best algorithms (and are similar to each other), while the kt algorithm fares worst. In some 
cases anti-kt also performs optimally. 

The penalty for choosing a non-optimal R can be even larger. For example, using SISCone with 
R = 0.4 (0.5) at 2 TeV leads to pc of about 1.75 (1.35) for the qq case, and ~ 3 (2) for the gg case. 
The use of ~ 0.4 — 0.5 is widespread in current LHC analyses (for example, ^22j used R = 0.5 
with the CMS iterative cone, which is similar to anti-kt) and if this is maintained up to high mass 
scales, it may lead to a need for twice as much integrated luminosity (or even more) to make a 
discovery as with an optimised choice of jet definition. 

A point worth bearing in mind is that the quality measures do not provide all relevant infor- 
mation about the peak. For example, for the small-mass gluonic case, the smallest R values do not 
lead to appreciably worse-than-optimal pc results, however if one examines the position of the peak 
(cf. the histograms available via the web-tool [6]) one sees that it becomes rather unstable at small 
ici [23]. Nevertheless, in most cases, and in particular for all but a few pathological regions of the 
jet-definition parameter-space, the position of the peak is sensible. 

So far we have discussed only simple, dijet events. The last row of figure [3] and the last two rows 



^In a few instances there are moderate differences between the two determinations of pc ■ This usually occurs when 
the resulting window widths for the two measures differ substantially (i.e. they probe the distribution with different 
effective resolutions). These cases, however, do not significantly alter any conclusions. 

^This instability is the cause also of the (somewhat spurious) decrease of pl at small R in the f 00 GeV gg C/A-filt 
case: there, the peak is at quite low masses (~ 50 GeV), and the limited phase-space towards yet-smaller masses 
causes it to narrow slightly as one further reduces R. Cases like this are rare and easy to identify when they occur. 
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C/A anti-kt SISCone C/A-filt 




0.5 1.0 1.5 0.5 1.0 1.5 0.5 1.0 1.5 0.5 1.0 1.5 0.5 1.0 1.5 
R R R R R 



Figure 5: For each process (one per row) this plot shows the luminosity ratio pc required in order 
to obtain the same significance as with the best jet definition. The (red) solid line corresponds to 
the estimate of pc from eq. ([8]) (based on the minimal width QJ^^), while the (blue) dotted line 

corresponds to eq. Q (based on the maximal fraction Q r—). 
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of figure [S] show results for more complex tt events, which here decay to 6 jets. A first observation is 
that the optimal R is fairly similar to that in the 100 GeV qq case — this is perhaps not surprising, 
since in both cases the energy of each jet is around 50 GeV. More detailed inspection shows however 
that the range of "acceptable" R is significantly smaller for the tt case than the qq case. This is 
most visible in figure [5l and especially for the SISCone algorithm. The reason is simple: in multijet 
events, as R is increased, jets that should represent distinct leading-order "partons" may end up 
being merged 

This issue should be kept in mind when planning analyses involving multijet final states at high 
energy scales — with current algorithms, there will then be a significant tension between the need for 
a large radius owing the high energy scale, and the need for a small radius in order to disentangle the 
many jets. In this respect, methods that use parameters other than just R to resolve jet structure 
(including some originally intended for jet substructure), such as [211 [25l EU ETJ [281 [29], are 
likely to be of prime importance in obtaining optimal jet results. The detailed general investigation 
of such "third-generation" jet-method^ deserves further work. 

A final comment concerns studies to reconstruct boosted top quarks, for example from high- mass 
resonances that decay to tt (see e.g. [30l [SU [32] ) . Significant recent work has gone into investigating 
the identification of top quarks in this context [331 [23 [23 [221 [Mj , where their decay products are 
often contained within a single jet. In such a situation, the best i?- value for carrying out a top- ID 
subjet analysis depends on the top pt, as in [28] (similar to the C/A-based subjet Higgs search of 
|17j). and is conditioned by the need to take a jet opening angle commensurate with the top-quark 
dead-cone size and decay angle, O [2mt/pt) {pt ~ M/2 where M is the resonance mass), so that 
the jet contains the top decay products, but not gluon emission from the top quark itself (which 
would smear and skew the top mass reconstruction). However, to obtain a good mass-resolution on 
the high-mass resonance that is reconstructed from the tt system, it is instead necessary to include 
any gluon radiation from the top quark inside the jets, and for this purpose, optimal R values will 
be the larger ones found here for normal qq dijet-events and will grow with pt (this issue obviously 
does not arise for boosted electroweak bosons). Thus, and in contrast to what has been investigated 
so far, in the work referred to above, one should work simultaneously with two R values: a small 
one, 0{2mt/pt) for identifying the top-quark decays, and a larger one, as given by figure [H for 
determining the top-quark momentum just as it was produced from the resonance decay. In this 
respect C/A-based solutions (including filtering) are particularly interesting insofar as they allow 
consistent views of an event at multiple i?- values. 

4 Results with pileup 

It is foreseen that the LHC will operate at a range of different luminosities. One should therefore 
establish whether the conclusions of the previous section are robust in the presence of multiple 
minimum-bias (MB) pileup (PU) events. We will consider low and high luminosity scenarios: 
jCiow = 0-05 mb~^ and £high = 0.25 mb~^ per bunch crossing|£| corresponding respectively to an 

^SlSCone's higher sensitivity to this effect is a consequence of the fact that for a given R value it can cluster two 
hard particles that are up to 2R apart, whereas the other algorithms reach out only to R. 

^ We use the term "first generation" jet-methods for the infrared and/or collinear unsafe cone algorithms of the 
'80s and '90s, "second generation" for subsequent infrared and collinear safe methods (recombination algorithms like 
JADE, kt, C/A and anti-fct, as well as the cone algorithm SISCone) that essentially use one main fixed parameter to 
specify the resolution for jets. By third generation methods, we have in mind those that exploit more than a single 
"view" of the event, and which may be based on a more powerful use of existing algorithms. 

^Even larger luminosities, £vhigh ~ 2.5 mb"'^, might be relevant at the sLHC upgrade (see for example [35j). 
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average of about 5 and 25 minimum-bias collisions per bunch crossing (or instantaneous luminosities 
of 2 X 10^^cm~^s~^ and 10^^cm~-^s~^ with a 25 ns bunch spacing). The MB events are simulated 
using Pythia 6.410 [7J with the DWT tune [3, as for our hard event samples, and the number of 
MB events added to a specific hard event has a Poisson distribution. 

It is well known that PU degrades mass resolution and shifts the energy scale, and the LHC 
experiments will attempt to correct for this. Currently their procedures tend to be highly detector- 
specific, which limits their applicability in a generic study such as ours. We will therefore use 
the jet-area-based pileup subtraction method of [36, 16j, which is experiment-independent and 
straightforward to use within the Fast Jet framework^ For completeness we give below a quick 
review of area-based subtraction, and then we will examine the impact of pileup both with and 
without its subtraction. 



4.1 Area-based pileup subtraction 

Area-based subtraction involves two elements: the calculation of jet areas, which represent a given 
jet's susceptibility to contamination from uniform background noise, and an estimate of the level 
of background noise in the event. 

As proposed in [16], for each jet j in an event, one can determine a 4- vector area in the rapidity- 
azimuth plane Afj_j. Given an estimate for the amount p of transverse-momentum per unit area due 

to background noise, a jet's corrected (i.e. subtracted) momentum p^^^^^ is obtained as |36j 

ptf^=P,j-\jP- (10) 

Subtracted jets are then used both for event selection and for the mass reconstructions. 

The quantity p is taken to be independent of rapidity (an acceptable approximation in much of 
the detector), and calculated on an event-by-event basis as 



p = median 




where the median is obtained over all jets with \y\ < 5|_j Regardless of the jet-definition used 
to analyse the hard event, p is always calculated using jets obtained with the kt algorithm with 
R = 0.5, which choice was found to be particularly robust in |36jf^ 



4.2 Results 

Jets are most strongly affected by pileup at large R values. Accordingly, in figure E] we show 
histograms for the 2TeV gg process, which without pileup favoured R> 1. The upper row shows 
results with no pileup, low and high pileup, all without subtraction. One sees the clear degradation 

Should the experiments' internal methods prove to be superior to the jet-area-based method, then the conclusions 
of this section will only be made more robust; if on the other hand they turn out to perform less well, there would be 
a compelling reason for them to adopt the area-based method. 

^^The details of the peak position after subtraction (not the main subject of our study here) can depend on the 
choice of jets used for calculating p, with the issue being largest for large R. As an example, in the 2TeV gg case 
with R = 1, the residual impact on the peak position can be of order 10 GeV. 

^■^Other technical settings are as follows: we use the active area for all jet algorithms except SISCone, which for 
speed reasons we use with the passive area. The area of ghost particles is set to 0.01, they cover the range \y\ < 6, and 
the repeat parameter is 1. All other parameters correspond to the defaults in FastJet 2.3. For C/A with filtering, 
the subtraction is carried out before the filtering stage. 
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Figure 6: Invariant mass distributions for the 2 TeV gg process, for the kt algorithm with R = 1, 
shown with no pileup (left), low pileup (middle) and high pileup (right), without subtraction (upper 
row) and with subtraction (lower row). The shaded bands indicate the region used to calculate the 
Qy=z quality measure in each case. 
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Figure 7: Illustration of the impact of subtraction in the absence of pileup. The effective luminosity 
ratios are based on the Q^=z measure, and normalised to the result for the best jet definition without 
pileup (no subtraction). The (red) solid curves show results without subtraction, the (blue) dotted 
curves with subtraction. 

in the quality of the peak as pileup is added, and this is reflected in the increasing values of the 
quality measure. There is additionally a shift of the peak to higher masses. 

The lower row of figure [6] shows the corresponding results with subtraction. Note that we have 
applied subtraction even in the case without pileup and one observes a non-negligible improvement 
in the peak quality. This is because of the contribution to the "noisiness" of the event from 
"underlying-event" (UE) activity, which is in part removed by the subtraction procedure. One sees 
even more significant improvements in the cases with pileup, highlighting the importance of the use 
of some form of pileup subtraction. 

Let us now consider this more systematically. First, in figure [7] we examine the impact of 
subtraction without pileup for all algorithms for the 100 GeV qq and 2 TeV gg processes. This is 
given in terms of the effective luminosity ratios normalised as in figure EJ i. e. to the lowest (best) 
value of the Q^=z Quality measure across all jet definitions without pileup (for brevity we omit 

results based on Q which do not change the overall conclusions). The (red) solid curve 

"Xll — _L . ^0 V iV_i 

is always the same as in figure [5] (i.e. for the jet algorithms without pileup or subtraction), while 
the (blue) dotted curve shows the results with subtraction. As expected, subtraction only matters 
significantly at large R. The algorithms that performed worst without subtraction are those that 
benefit the most from it, leading to quite similar optimal quality for all algorithms. 

Next, in figures [8] and [U we show results respectively for low and high pileup. We maintain 
the same normalisation for the effective luminosity as before, and the (red) solid curves remain 
those of figure [5] throughout (no pileup, no subtraction). The (green) dashed curves show the 
effective luminosity ratios with unsubtracted pileup, while the (blue) dotted curves correspond to 
subtracted pileup. Unsubtracted pileup, unsurprisingly, degrades the quality in almost all cases, 
more so at larger R values. This ii-dependence of the quality degradation causes the minima to 
shift to moderately smaller R values. Subtraction compensates for part of the loss of quality (and 
the shift in best R) due to pileup, though to an extent that varies according to the case at hand. 

For the purpose of this article, the main conclusion from figures [8] and [9] is that, if one chooses 
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a jet-definition that is optimal in the case without pileup (i.e. based on the results of section ED, 
then in the presence of pileup (with subtraction) it gives a pc that remains within about 10% of the 
lowest possible value. Insofar as a given analysis may involve data taken at a range of instantaneous 
luminosities (which may even vary significantly over the lifetime of the beam), this is important 
since it implies that it will be satisfactory to choose a single jet-definition independently of the level 
of pileup. 
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Figure 8: Illustration of the impact of low pileup, 0.05 mb~ per bunch crossing. Luminosity ratios 
have been calculated based on the Qy=z measure, and normalised to the result for the best jet 
definition without pileup (no subtraction). The (red) solid curves show the result with no pileup 
and no subtraction, the (green) dashed curves have pileup without subtraction and the (blue) dotted 
curves have pileup and subtraction. 
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5 Conclusions 



In this paper, we have examined the question of assessing the relative quaUty of a range of jet 
definitions for kinematic reconstructions. In contrast to other common approaches, we chose not 
to determine the quahty of a jet definition in terms of how well its jets correspond to a given (but 
ill-defined) set of hard Monte-Carlo partons. Instead we used physically well-defined measures, i.e. 
the reconstruction of an invariant (dijet or top) mass peak. The quality of a given jet-definition is 
then related to the "sharpness" of the mass peak. Since sharpness is a somewhat fuzzy concept we 
introduced two "quality measures" to quantify the concept, independently of the peak shape, which 
is usually strongly non-Gaussian. With certain hypotheses, one can establish a proportionality 
relation between these quality measures and the amount of integrated luminosity needed to obtain 
a given statistical significance in a search. 

We studied the cases of narrow qq and gg resonances over mass scales ranging from 100 GeV 
to 4TeV, as well as top and W reconstruction in tt events. We considered 5 jet algorithms, kt, 
Cambridge/ Aachen (C/A), anti-kt, SlSCon^ and C/A with filtering, over jet-radii, R, spanning 
typically from 0.3 to 1.3. 

Our results (available in a more extensive form via the online tool [6j) tend to validate the 
existing widespread use of low radii 0.4 — 0.5 in reconstructing quark-induced jets at mass scales of 
O (100 GeV). However they also show that gluon-induced jets and high-scale jets prefer significantly 
larger R, up to an optimal value of i? ~ 1.2 for the high- mass gg case. This general pattern coincides 
broadly with analytical expectations [20], and relates to an interplay between needing to capture 
perturbative radiation from the jet, while excluding underlying-event contamination. The former 
matters more at high scales and for gluon jets, hence the preference for larger R. 

A second pattern that emerges, relevant mainly at higher energy scales, is that among traditional 
(or "second generation," cf. footnote [8]) types of jet algorithm, SISCone often performs best and 
anti-Zct performs better than other sequential recombination algorithms, kt and C/A. The third- 
generation C/A- filtering algorithm typically performs as well as SISCone, but prefers slightly larger 
R. Both SISCone and C/ A- filtering's good performance can be traced back to their low sensitivity 
to underlying-event activity. 

A quantitative presentation of these results is given in figure [5l for a subset of processes, in 
terms of the extra factor in integrated luminosity that would be needed for a given jet definition to 
achieve the same significance as the optimal one in our set. An implication for LHC experiments 
planning to use R = 0.4 — 0.6 even in large-mass searches (see e.g. |2HI22j). is that some discoveries 
may then require up to twice more integrated luminosity than would be the case with the optimal 
choice of jet-definition. 

Given such a statement, it is important to establish how it is affected by pileup. This was the 
subject of sectionlH The conclusions are that the optimal jet definition without pileup remains close 
to optimal even with high-luminosity pileup (i.e. with ~ 25 pp interactions per bunch crossings), 
provided that adequate subtraction methods are used to correct the jets for the pileup. Subtraction 
also reduces the differences between jet algorithms, even in the absence of pileup, cf. figure [71 

Finally, given that the bulk of our results apply to dijet events, one may ask to what extent they 
hold in multi-jet situations. To investigate this, we have studied hadronic tt events and observed 
results similar to those for the low-mass qq case. However, we envisage that in multi-jet events 
at high mass scales there will be an additional tension between the need to resolve the separate 

We recall that anti-fct is expected to behave similarly to iterative cones with progressive removal (like the current 
CMS iterative cone), while being infrared and collinear (IRC) safe. SISCone is an IRC safe cone algorithm with a 
split-merge step, and is therefore closer to iterative cone algorithms like the current ATLAS cone. 
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jets and the need to include the bulk of perturbative gluon emission in the jet. The study of the 
issue is beyond the scope of this article, but we foresee that future developments in third-generation 
jet-methods can play an important role in optimising analyses of such events. 
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